Discovering novel subsystems using comparative genomics

نویسندگان

  • Luciana Ferrer
  • Alexander Glennon Shearer
  • Peter D. Karp
چکیده

MOTIVATION Key problems for computational genomics include discovering novel pathways in genome data, and discovering functional interaction partners for genes to define new members of partially elucidated pathways. RESULTS We propose a novel method for the discovery of subsystems from annotated genomes. For each gene pair, a score measuring the likelihood that the two genes belong to a same subsystem is computed using genome context methods. Genes are then grouped based on these scores, and the resulting groups are filtered to keep only high-confidence groups. Since the method is based on genome context analysis, it relies solely on structural annotation of the genomes. The method can be used to discover new pathways, find missing genes from a known pathway, find new protein complexes or other kinds of functional groups and assign function to genes. We tested the accuracy of our method in Escherichia coli K-12. In one configuration of the system, we find that 31.6% of the candidate groups generated by our method match a known pathway or protein complex closely, and that we rediscover 31.2% of all known pathways and protein complexes of at least 4 genes. We believe that a significant proportion of the candidates that do not match any known group in E.coli K-12 corresponds to novel subsystems that may represent promising leads for future laboratory research. We discuss in-depth examples of these findings. AVAILABILITY Predicted subsystems are available at http://brg.ai.sri.com/pwy-discovery/journal.html. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Schistosoma comparative genomics: integrating genome structure, parasite biology and anthelmintic discovery.

Schistosoma genomes provide a comprehensive resource for identifying the molecular processes that shape parasite evolution and for discovering novel chemotherapeutic or immunoprophylactic targets. Here, we demonstrate how intragenus and intergenus comparative genomics can be used to drive these investigations forward, illustrate the advantages and limitations of these approaches and review how ...

متن کامل

Genomic filtering: an approach to discovering novel antiparasitics.

Genomic filtering is a rapid approach to identifying and prioritizing molecular targets for drug discovery. For infectious disease applications, comparative genomics filters allow the selection of pathogen-specific gene products, whereas functional genomics filters, such as RNA interference (RNAi), allow the selection of gene products essential for pathogen survival. The approach is especially ...

متن کامل

Comparative genomics of human stem cell factor (SCF)

Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...

متن کامل

Discovery of virulence factors of pathogenic bacteria.

Discovering virulence factors of pathogenic bacteria is a key in understanding pathogenesis and for identification of targets for novel drugs and design of new vaccines. Comparative genomics, transcriptomics, and proteomics have become the popular tools in discovering the virulence factors in bacterial pathogens, such as Neisseria meningitidis, Yersinia pestis, Mycobacterium tuberculosis, and S...

متن کامل

Cross-species Comparison for Identifying Orthologous Simple Sequence Repeats of Disease Genes

Simple sequence repeats (SSRs) have been demonstrated to affect normal gene function to cause different genetic disorders. Several conserved and even partial functional SSR patterns were discovered in inherited orthologous disease genes. To explore a wide range of SSRs in genetic diseases, a system focuses on orthologous SSRs for disease genes through comparative genomics mechanism is construct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 27 18  شماره 

صفحات  -

تاریخ انتشار 2011